A Weakly Supervised Learning-Based Oversampling Framework for Class-Imbalanced Fault Diagnosis
نویسندگان
چکیده
With the lack of failure data, class imbalance has become a common challenge in fault diagnosis industrial systems. The oversampling methods can tackle class-imbalanced problem by generating minority samples to balance training set. However, one main challenges existing is how generate high-quality samples. Traditional regard all synthetic as ones be added set without filtering. low-quality would distort distribution dataset and worsen classification performance. In this article, we propose weakly supervised method that treats unlabeled develops graph semisupervised learning algorithm select samples, adding into final To improve quality cost-sensitive neighborhood component analysis dimensionality reduction enhance domain information validity high-dimensional datasets. Finally, combining boosting-based ensemble framework, new imbalanced framework suitable for high highly experimental validation performed on five real-world wind turbine blade cracking datasets compared 15 benchmark methods. results show average performances robustness proposed are significantly better than those
منابع مشابه
Oversampling for Imbalanced Learning Based on K-Means and SMOTE
Learning from class-imbalanced data continues to be a common and challenging problem in supervised learning as standard classification algorithms are designed to handle balanced class distributions. While different strategies exist to tackle this problem, methods which generate artificial data to achieve a balanced class distribution are more versatile than modifications to the classification a...
متن کاملOversampling Method for Imbalanced Classification
Classification problem for imbalanced datasets is pervasive in a lot of data mining domains. Imbalanced classification has been a hot topic in the academic community. From data level to algorithm level, a lot of solutions have been proposed to tackle the problems resulted from imbalanced datasets. SMOTE is the most popular data-level method and a lot of derivations based on it are developed to ...
متن کاملA Synthetic Minority Oversampling Method Based on Local Densities in Low-Dimensional Space for Imbalanced Learning
Imbalanced class distribution is a challenging problem in many real-life classification problems. Existing synthetic oversampling do suffer from the curse of dimensionality because they rely heavily on Euclidean distance. This paper proposed a new method, called Minority Oversampling Technique based on Local Densities in Low-Dimensional Space (or MOT2LD in short). MOT2LD first maps each trainin...
متن کاملImbalanced Multiple Noisy Labeling for Supervised Learning
When labeling objects via Internet-based outsourcing systems, the labelers may have bias, because they lack expertise, dedication and personal preference. These reasons cause Imbalanced Multiple Noisy Labeling. To deal with the imbalance labeling issue, we propose an agnostic algorithm PLAT (Positive LAbel frequency Threshold) which does not need any information about quality of labelers and un...
متن کاملSemi-Supervised Learning for Imbalanced Sentiment Classification
Various semi-supervised learning methods have been proposed recently to solve the long-standing shortage problem of manually labeled data in sentiment classification. However, most existing studies assume the balance between negative and positive samples in both the labeled and unlabeled data, which may not be true in reality. In this paper, we investigate a more common case of semi-supervised ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Reliability
سال: 2022
ISSN: ['1558-1721', '0018-9529']
DOI: https://doi.org/10.1109/tr.2021.3138448